Overview

Dataset statistics

Number of variables35
Number of observations2143
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory586.1 KiB
Average record size in memory280.1 B

Variable types

CAT17
BOOL9
NUM9

Reproduction

Analysis started2020-07-25 19:03:22.282475
Analysis finished2020-07-25 19:03:46.785952
Duration24.5 seconds
Versionpandas-profiling v2.8.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml

Warnings

NOM_SITUACAO_PERIODO_ANT has constant value "Cursado" Constant
VAL_DIVIDA_TOTAL is highly correlated with VAL_DIVIDA_MENSHigh correlation
VAL_DIVIDA_MENS is highly correlated with VAL_DIVIDA_TOTALHigh correlation
QTD_ACESSOS_20_1 is highly correlated with QTD_ACESSOS_19_2High correlation
QTD_ACESSOS_19_2 is highly correlated with QTD_ACESSOS_20_1High correlation
CLASSE_PAGANTE_ATU is highly correlated with CLASSE_PAGANTE_ANTHigh correlation
CLASSE_PAGANTE_ANT is highly correlated with CLASSE_PAGANTE_ATUHigh correlation
COD_MATRICULA has unique values Unique
VAL_A_PAGAR has 663 (30.9%) zeros Zeros
VAL_A_PAGAR_PAR has 2011 (93.8%) zeros Zeros
VAL_DIVIDA_MENS has 1876 (87.5%) zeros Zeros
VAL_DIVIDA_TOTAL has 1728 (80.6%) zeros Zeros
CR_PER_ANT has 138 (6.4%) zeros Zeros

Variables

COD_MATRICULA
Real number (ℝ≥0)

UNIQUE

Distinct count2143
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean201729628317.4839
Minimum200701339023
Maximum201912030951
Zeros0
Zeros (%)0.0%
Memory size16.7 KiB

Quantile statistics

Minimum2.00701339e+11
5-th percentile2.01403269e+11
Q12.016030709e+11
median2.018010849e+11
Q32.019012712e+11
95-th percentile2.019085073e+11
Maximum2.01912031e+11
Range1210691928
Interquartile range (IQR)298200276.5

Descriptive statistics

Standard deviation172250934.4
Coefficient of variation (CV)0.0008538702811
Kurtosis3.123661373
Mean2.017296283e+11
Median Absolute Deviation (MAD)101115048
Skewness-1.354607483
Sum4.323065935e+14
Variance2.967038441e+16
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2.017090416e+111< 0.1%
 
2.017081645e+111< 0.1%
 
2.018023909e+111< 0.1%
 
2.019022219e+111< 0.1%
 
2.016074766e+111< 0.1%
 
2.019085924e+111< 0.1%
 
2.019086619e+111< 0.1%
 
2.01902022e+111< 0.1%
 
2.019085767e+111< 0.1%
 
2.017010963e+111< 0.1%
 
Other values (2133)213399.5%
 
ValueCountFrequency (%) 
2.00701339e+111< 0.1%
 
2.007021982e+111< 0.1%
 
2.007021987e+111< 0.1%
 
2.00801431e+111< 0.1%
 
2.009014022e+111< 0.1%
 
ValueCountFrequency (%) 
2.01912031e+111< 0.1%
 
2.019120309e+111< 0.1%
 
2.019120309e+111< 0.1%
 
2.019120309e+111< 0.1%
 
2.019120308e+111< 0.1%
 
Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size16.7 KiB
1
1736
0
407
ValueCountFrequency (%) 
1173681.0%
 
040719.0%
 
Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size16.7 KiB
1
1748
0
395
ValueCountFrequency (%) 
1174881.6%
 
039518.4%
 
Distinct count8
Unique (%)0.4%
Missing0
Missing (%)0.0%
Memory size16.7 KiB
1 - ALUNO RENOVADO
1774
9 - ALTO PROPENSO
 
104
8 - BAIXO PROPENSO
 
102
7- BAIXO APROVEITAMENTO
 
98
2 - ALUNO EVADIDO
 
33
Other values (3)
 
32
ValueCountFrequency (%) 
1 - ALUNO RENOVADO177482.8%
 
9 - ALTO PROPENSO1044.9%
 
8 - BAIXO PROPENSO1024.8%
 
7- BAIXO APROVEITAMENTO984.6%
 
2 - ALUNO EVADIDO331.5%
 
3 - PEDIDO DE TRANCAMENTO221.0%
 
4 - EM ENTURMACAO70.3%
 
6 - ACEITE DE CONTRATO30.1%
 

Length

Max length25
Median length18
Mean length18.23891741
Min length17
Distinct count9
Unique (%)0.4%
Missing0
Missing (%)0.0%
Memory size16.7 KiB
ALUNO RENOVADO
1774
INADIMPLENTE
 
195
BOLETO RENOVACAO NAO PAGO
 
97
ALUNO EVADIDO SEM RENOVAR
 
33
PEDIDO DE TRANCAMENTO
 
22
Other values (4)
 
22
ValueCountFrequency (%) 
ALUNO RENOVADO177482.8%
 
INADIMPLENTE1959.1%
 
BOLETO RENOVACAO NAO PAGO974.5%
 
ALUNO EVADIDO SEM RENOVAR331.5%
 
PEDIDO DE TRANCAMENTO221.0%
 
BOLETO RENOVACAO NAO GERADO120.6%
 
GAP ENTURMACAO40.2%
 
DELAY SISTEMICO (ENTURMADO)30.1%
 
ACEITE DE CONTRATO30.1%
 

Length

Max length27
Median length14
Mean length14.65375642
Min length12
Distinct count3
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size16.7 KiB
ADIMPLENTE
1789
INADIMPLENTE COM NEGOCIACAO
 
191
INADIMPLENTE SEM NEGOCIACAO
 
163
ValueCountFrequency (%) 
ADIMPLENTE178983.5%
 
INADIMPLENTE COM NEGOCIACAO1918.9%
 
INADIMPLENTE SEM NEGOCIACAO1637.6%
 

Length

Max length27
Median length10
Mean length12.80821279
Min length10
Distinct count4
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size16.7 KiB
SEM RISCO
895
BAIXO RISCO
847
MEDIO RISCO
254
ALTO RISCO
 
147
ValueCountFrequency (%) 
SEM RISCO89541.8%
 
BAIXO RISCO84739.5%
 
MEDIO RISCO25411.9%
 
ALTO RISCO1476.9%
 

Length

Max length11
Median length11
Mean length10.09612692
Min length9

COD_CURSO
Real number (ℝ≥0)

Distinct count13
Unique (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean792.4405039664023
Minimum1
Maximum4070
Zeros0
Zeros (%)0.0%
Memory size16.7 KiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median29
Q3372
95-th percentile4004
Maximum4070
Range4069
Interquartile range (IQR)370

Descriptive statistics

Standard deviation1492.372055
Coefficient of variation (CV)1.883260696
Kurtosis0.871889804
Mean792.440504
Median Absolute Deviation (MAD)28
Skewness1.681330267
Sum1698200
Variance2227174.351
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
143420.3%
 
2934416.1%
 
37229413.7%
 
16621510.0%
 
21728.0%
 
40041456.8%
 
2071446.7%
 
40021054.9%
 
21884.1%
 
11743.5%
 
Other values (3)1286.0%
 
ValueCountFrequency (%) 
143420.3%
 
21728.0%
 
11743.5%
 
21884.1%
 
2934416.1%
 
ValueCountFrequency (%) 
4070703.3%
 
40041456.8%
 
4003562.6%
 
40021054.9%
 
37229413.7%
 

COD_TURNO
Categorical

Distinct count3
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size16.7 KiB
3
2108
1
 
34
8
 
1
ValueCountFrequency (%) 
3210898.4%
 
1341.6%
 
81< 0.1%
 

Length

Max length1
Median length1
Mean length1
Min length1

COD_TIPO_CURSO
Categorical

Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size16.7 KiB
11
2073
4
 
70
ValueCountFrequency (%) 
11207396.7%
 
4703.3%
 

Length

Max length2
Median length2
Mean length1.967335511
Min length1

CLASSE_PAGANTE_ANT
Categorical

HIGH CORRELATION

Distinct count4
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size16.7 KiB
MENSALISTA
1463
FIES
303
PROUNI
 
198
PAR
 
179
ValueCountFrequency (%) 
MENSALISTA146368.3%
 
FIES30314.1%
 
PROUNI1989.2%
 
PAR1798.4%
 

Length

Max length10
Median length10
Mean length8.197386841
Min length3

CLASSE_PAGANTE_ATU
Categorical

HIGH CORRELATION

Distinct count4
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size16.7 KiB
MENSALISTA
1519
FIES
 
251
PROUNI
 
194
PAR
 
179
ValueCountFrequency (%) 
MENSALISTA151970.9%
 
FIES25111.7%
 
PROUNI1949.1%
 
PAR1798.4%
 

Length

Max length10
Median length10
Mean length8.350443304
Min length3

NOVO_FIES
Categorical

Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size16.7 KiB
NÃO
2078
SIM
 
65
ValueCountFrequency (%) 
NÃO207897.0%
 
SIM653.0%
 

Length

Max length3
Median length3
Mean length3
Min length3

PRV_ANT
Boolean

Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size16.7 KiB
0
2064
1
 
79
ValueCountFrequency (%) 
0206496.3%
 
1793.7%
 

PRV_ATU
Boolean

Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size16.7 KiB
0
2090
1
 
53
ValueCountFrequency (%) 
0209097.5%
 
1532.5%
 

LATE_COMER
Categorical

Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size16.7 KiB
ON TIME COMER
2105
LATE COMER
 
38
ValueCountFrequency (%) 
ON TIME COMER210598.2%
 
LATE COMER381.8%
 

Length

Max length13
Median length13
Mean length12.94680355
Min length10
Distinct count3
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size16.7 KiB
Ativo
2081
Transferido
 
36
Trancado
 
26
ValueCountFrequency (%) 
Ativo208197.1%
 
Transferido361.7%
 
Trancado261.2%
 

Length

Max length11
Median length5
Mean length5.137190854
Min length5

NOM_SITUACAO_PERIODO_ANT
Categorical

CONSTANT
REJECTED

Distinct count1
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size16.7 KiB
Cursado
2143
ValueCountFrequency (%) 
Cursado2143100.0%
 

Length

Max length7
Median length7
Mean length7
Min length7

VAL_A_PAGAR
Real number (ℝ≥0)

ZEROS

Distinct count1260
Unique (%)58.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean525.4301539897341
Minimum0.0
Maximum3210.35
Zeros663
Zeros (%)30.9%
Memory size16.7 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median445.18
Q3866.935
95-th percentile1504.16
Maximum3210.35
Range3210.35
Interquartile range (IQR)866.935

Descriptive statistics

Standard deviation532.3464468
Coefficient of variation (CV)1.013163106
Kurtosis0.3996551506
Mean525.430154
Median Absolute Deviation (MAD)445.18
Skewness0.9029764241
Sum1125996.82
Variance283392.7394
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
066330.9%
 
1486.8280.4%
 
1489.5470.3%
 
1927.6460.3%
 
1109.9360.3%
 
1134.7760.3%
 
1612.4560.3%
 
1047.4850.2%
 
1504.1650.2%
 
827.9150.2%
 
Other values (1250)142666.5%
 
ValueCountFrequency (%) 
066330.9%
 
6.481< 0.1%
 
7.591< 0.1%
 
12.471< 0.1%
 
12.71< 0.1%
 
ValueCountFrequency (%) 
3210.351< 0.1%
 
2791.81< 0.1%
 
2521.121< 0.1%
 
2294.71< 0.1%
 
2288.111< 0.1%
 

VAL_A_PAGAR_PAR
Real number (ℝ≥0)

ZEROS

Distinct count109
Unique (%)5.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.624316378908071
Minimum0.0
Maximum1185.36
Zeros2011
Zeros (%)93.8%
Memory size16.7 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile4.54
Maximum1185.36
Range1185.36
Interquartile range (IQR)0

Descriptive statistics

Standard deviation111.9038394
Coefficient of variation (CV)7.16215908
Kurtosis65.21248557
Mean15.62431638
Median Absolute Deviation (MAD)0
Skewness7.921151567
Sum33482.91
Variance12522.46928
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0201193.8%
 
4.9150.2%
 
4.2530.1%
 
4.2230.1%
 
4.5430.1%
 
14.1530.1%
 
4.3930.1%
 
3.9320.1%
 
4.720.1%
 
9.9820.1%
 
Other values (99)1064.9%
 
ValueCountFrequency (%) 
0201193.8%
 
2.611< 0.1%
 
2.981< 0.1%
 
3.351< 0.1%
 
3.471< 0.1%
 
ValueCountFrequency (%) 
1185.361< 0.1%
 
1160.811< 0.1%
 
1156.581< 0.1%
 
1152.231< 0.1%
 
1139.541< 0.1%
 

VAL_DIVIDA_MENS
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct count232
Unique (%)10.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean300.6963695753617
Minimum0.0
Maximum15131.22
Zeros1876
Zeros (%)87.5%
Memory size16.7 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile2322.319
Maximum15131.22
Range15131.22
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1228.325304
Coefficient of variation (CV)4.084935597
Kurtosis41.37441251
Mean300.6963696
Median Absolute Deviation (MAD)0
Skewness5.808100793
Sum644392.32
Variance1508783.052
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0187687.5%
 
20281.3%
 
4070.3%
 
2520.1%
 
1219.3820.1%
 
8020.1%
 
11183.871< 0.1%
 
1963.031< 0.1%
 
1890.421< 0.1%
 
8963.031< 0.1%
 
Other values (222)22210.4%
 
ValueCountFrequency (%) 
0187687.5%
 
9.81< 0.1%
 
10.541< 0.1%
 
19.191< 0.1%
 
20281.3%
 
ValueCountFrequency (%) 
15131.221< 0.1%
 
12473.741< 0.1%
 
11935.041< 0.1%
 
11722.091< 0.1%
 
11183.871< 0.1%
 

VAL_DIVIDA_TOTAL
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct count380
Unique (%)17.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean401.22972001866543
Minimum0.0
Maximum15370.45
Zeros1728
Zeros (%)80.6%
Memory size16.7 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile2815.67
Maximum15370.45
Range15370.45
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1381.496052
Coefficient of variation (CV)3.443154839
Kurtosis33.68297118
Mean401.22972
Median Absolute Deviation (MAD)0
Skewness5.188251528
Sum859835.29
Variance1908531.342
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0172880.6%
 
20281.3%
 
4070.3%
 
1219.3820.1%
 
2520.1%
 
8020.1%
 
1710.211< 0.1%
 
207.91< 0.1%
 
396.121< 0.1%
 
385.021< 0.1%
 
Other values (370)37017.3%
 
ValueCountFrequency (%) 
0172880.6%
 
3.711< 0.1%
 
4.221< 0.1%
 
4.421< 0.1%
 
4.541< 0.1%
 
ValueCountFrequency (%) 
15370.451< 0.1%
 
15131.221< 0.1%
 
12473.741< 0.1%
 
11935.041< 0.1%
 
11788.91< 0.1%
 

FAIXA_DE_DIVIDA
Categorical

Distinct count7
Unique (%)0.3%
Missing0
Missing (%)0.0%
Memory size16.7 KiB
SEM DIVIDA
1943
> 1500
 
120
1000-1500
 
26
750-1000
 
15
500-750
 
14
Other values (2)
 
25
ValueCountFrequency (%) 
SEM DIVIDA194390.7%
 
> 15001205.6%
 
1000-1500261.2%
 
750-1000150.7%
 
500-750140.7%
 
100-300140.7%
 
300-500110.5%
 

Length

Max length10
Median length10
Mean length9.695286981
Min length6
Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size16.7 KiB
0
2007
1
 
136
ValueCountFrequency (%) 
0200793.7%
 
11366.3%
 

CR_PER_ANT
Real number (ℝ≥0)

ZEROS

Distinct count511
Unique (%)23.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.5674381707886145
Minimum0.0
Maximum10.0
Zeros138
Zeros (%)6.4%
Memory size16.7 KiB

Quantile statistics

Minimum0
5-th percentile0
Q16.175
median7.09
Q37.885
95-th percentile8.889
Maximum10
Range10
Interquartile range (IQR)1.71

Descriptive statistics

Standard deviation2.206717911
Coefficient of variation (CV)0.3360089358
Kurtosis2.907353729
Mean6.567438171
Median Absolute Deviation (MAD)0.84
Skewness-1.786154561
Sum14074.02
Variance4.869603938
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
01386.4%
 
7351.6%
 
6261.2%
 
6.5160.7%
 
7.5160.7%
 
8150.7%
 
7.34130.6%
 
7.43120.6%
 
7.15120.6%
 
6.6120.6%
 
Other values (501)184886.2%
 
ValueCountFrequency (%) 
01386.4%
 
0.191< 0.1%
 
0.371< 0.1%
 
0.51< 0.1%
 
0.620.1%
 
ValueCountFrequency (%) 
1060.3%
 
9.8920.1%
 
9.81< 0.1%
 
9.7520.1%
 
9.711< 0.1%
 

FAIXA_APROVACAO
Categorical

Distinct count7
Unique (%)0.3%
Missing0
Missing (%)0.0%
Memory size16.7 KiB
7 - 100%
1284
5 - 61% A 80%
275
1 - 0%
 
181
4 - 41% A 60%
 
140
6 - 81% A 99%
 
136
Other values (2)
 
127
ValueCountFrequency (%) 
7 - 100%128459.9%
 
5 - 61% A 80%27512.8%
 
1 - 0%1818.4%
 
4 - 41% A 60%1406.5%
 
6 - 81% A 99%1366.3%
 
3 - 21% A 40%1014.7%
 
2 - 1% A 20%261.2%
 

Length

Max length13
Median length8
Mean length9.400839944
Min length6

QTD_ACESSOS_19_2
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count272
Unique (%)12.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean87.53896406906206
Minimum0.0
Maximum506.0
Zeros5
Zeros (%)0.2%
Memory size16.7 KiB

Quantile statistics

Minimum0
5-th percentile19
Q147
median74
Q3111
95-th percentile199
Maximum506
Range506
Interquartile range (IQR)64

Descriptive statistics

Standard deviation60.4689675
Coefficient of variation (CV)0.6907663135
Kurtosis5.281469482
Mean87.53896407
Median Absolute Deviation (MAD)31
Skewness1.813572981
Sum187596
Variance3656.49603
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
71291.4%
 
50251.2%
 
52241.1%
 
65241.1%
 
35231.1%
 
86231.1%
 
49231.1%
 
68221.0%
 
51221.0%
 
69221.0%
 
Other values (262)190688.9%
 
ValueCountFrequency (%) 
050.2%
 
130.1%
 
220.1%
 
31< 0.1%
 
41< 0.1%
 
ValueCountFrequency (%) 
5061< 0.1%
 
4441< 0.1%
 
4191< 0.1%
 
4161< 0.1%
 
4101< 0.1%
 

QTD_ACESSOS_20_1
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count251
Unique (%)11.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean77.90153989734017
Minimum0.0
Maximum561.0
Zeros15
Zeros (%)0.7%
Memory size16.7 KiB

Quantile statistics

Minimum0
5-th percentile12
Q141
median66
Q3102
95-th percentile181
Maximum561
Range561
Interquartile range (IQR)61

Descriptive statistics

Standard deviation55.4809976
Coefficient of variation (CV)0.7121938497
Kurtosis7.254037931
Mean77.9015399
Median Absolute Deviation (MAD)29
Skewness1.908805566
Sum166943
Variance3078.141095
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
41301.4%
 
48291.4%
 
53281.3%
 
50271.3%
 
58271.3%
 
39271.3%
 
51251.2%
 
32251.2%
 
47251.2%
 
59241.1%
 
Other values (241)187687.5%
 
ValueCountFrequency (%) 
0150.7%
 
190.4%
 
230.1%
 
360.3%
 
450.2%
 
ValueCountFrequency (%) 
5611< 0.1%
 
4591< 0.1%
 
4261< 0.1%
 
4231< 0.1%
 
4031< 0.1%
 
Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size16.7 KiB
0
2032
1
 
111
ValueCountFrequency (%) 
0203294.8%
 
11115.2%
 
Distinct count5
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size16.7 KiB
4 - Baixo
1133
3 - Médio Baixo
650
2 - Médio Alto
 
168
1 - Alto
 
160
00 - Não Escorado
 
32
ValueCountFrequency (%) 
4 - Baixo113352.9%
 
3 - Médio Baixo65030.3%
 
2 - Médio Alto1687.8%
 
1 - Alto1607.5%
 
00 - Não Escorado321.5%
 

Length

Max length17
Median length9
Mean length11.25664956
Min length8
Distinct count5
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size16.7 KiB
3 - Médio Baixo
1290
2 - Médio Alto
422
4 - Baixo
353
1 - Alto
 
46
00 - Não Escorado
 
32
ValueCountFrequency (%) 
3 - Médio Baixo129060.2%
 
2 - Médio Alto42219.7%
 
4 - Baixo35316.5%
 
1 - Alto462.1%
 
00 - Não Escorado321.5%
 

Length

Max length17
Median length15
Mean length13.69435371
Min length8
Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size16.7 KiB
1
2048
0
 
95
ValueCountFrequency (%) 
1204895.6%
 
0954.4%
 
Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size16.7 KiB
0
2142
1
 
1
ValueCountFrequency (%) 
02142> 99.9%
 
11< 0.1%
 
Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size16.7 KiB
0
2080
1
 
63
ValueCountFrequency (%) 
0208097.1%
 
1632.9%
 

SAFRA
Categorical

Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size16.7 KiB
VETERANO
1917
CALOURO
 
226
ValueCountFrequency (%) 
VETERANO191789.5%
 
CALOURO22610.5%
 

Length

Max length8
Median length8
Mean length7.894540364
Min length7

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

COD_MATRICULAALUNO_MATRICULADO_PROX_PERIODOALUNO_MATRICULADOCONDICAO_GERENCIALCONDICAO_IMPEDE_RENOVACAOTIPO_INADIMPLENCIARISCO_INADIMPLENCIACOD_CURSOCOD_TURNOCOD_TIPO_CURSOCLASSE_PAGANTE_ANTCLASSE_PAGANTE_ATUNOVO_FIESPRV_ANTPRV_ATULATE_COMERNOM_SITUACAO_ALUNONOM_SITUACAO_PERIODO_ANTVAL_A_PAGARVAL_A_PAGAR_PARVAL_DIVIDA_MENSVAL_DIVIDA_TOTALFAIXA_DE_DIVIDAADIMP_N_RENCR_PER_ANTFAIXA_APROVACAOQTD_ACESSOS_19_2QTD_ACESSOS_20_1IND_INDICIO_EVASAOCLASSIFICACAO_PROP_EVASAOCLASSIFICACAO_PROP_RENOVACAOACEITE_CONTRATOPASTA_VERMELHAREQ_AGEND_TRANCSAFRA
0200701339023009 - ALTO PROPENSOBOLETO RENOVACAO NAO PAGOADIMPLENTESEM RISCO1311MENSALISTAMENSALISTANÃO00ON TIME COMERAtivoCursado51.180.00.000.00SEM DIVIDA19.007 - 100%32.021.001 - Alto3 - Médio Baixo000VETERANO
1200702198245008 - BAIXO PROPENSOINADIMPLENTEINADIMPLENTE SEM NEGOCIACAOMEDIO RISCO4004311MENSALISTAMENSALISTANÃO00ON TIME COMERAtivoCursado0.000.03111.203111.20> 150005.914 - 41% A 60%48.048.003 - Médio Baixo4 - Baixo100CALOURO
2200702198679008 - BAIXO PROPENSOINADIMPLENTEINADIMPLENTE SEM NEGOCIACAOMEDIO RISCO29311MENSALISTAMENSALISTANÃO00ON TIME COMERAtivoCursado0.000.03238.773238.77> 150005.273 - 21% A 40%53.046.004 - Baixo4 - Baixo000VETERANO
3200801431011111 - ALUNO RENOVADOALUNO RENOVADOADIMPLENTESEM RISCO29311MENSALISTAMENSALISTANÃO00ON TIME COMERAtivoCursado2521.120.00.000.00SEM DIVIDA09.897 - 100%130.0119.004 - Baixo2 - Médio Alto100VETERANO
4200901402225111 - ALUNO RENOVADOALUNO RENOVADOADIMPLENTEBAIXO RISCO1311MENSALISTAMENSALISTANÃO00ON TIME COMERAtivoCursado170.960.00.000.00SEM DIVIDA03.504 - 41% A 60%125.070.0000 - Não Escorado00 - Não Escorado100VETERANO
5201001152832111 - ALUNO RENOVADOALUNO RENOVADOADIMPLENTEBAIXO RISCO2311MENSALISTAMENSALISTANÃO00ON TIME COMERAtivoCursado663.660.00.000.00SEM DIVIDA06.007 - 100%87.076.004 - Baixo3 - Médio Baixo100VETERANO
6201001153693008 - BAIXO PROPENSOINADIMPLENTEINADIMPLENTE SEM NEGOCIACAOBAIXO RISCO1311MENSALISTAMENSALISTANÃO00ON TIME COMERAtivoCursado0.000.01013.561013.561000-150007.757 - 100%11.09.003 - Médio Baixo4 - Baixo100CALOURO
7201001211669111 - ALUNO RENOVADOALUNO RENOVADOADIMPLENTESEM RISCO2311MENSALISTAMENSALISTANÃO00ON TIME COMERAtivoCursado883.800.00.000.00SEM DIVIDA06.805 - 61% A 80%158.082.002 - Médio Alto3 - Médio Baixo100VETERANO
8201001462891111 - ALUNO RENOVADOALUNO RENOVADOADIMPLENTESEM RISCO166311PROUNIPROUNINÃO00ON TIME COMERAtivoCursado0.000.00.000.00SEM DIVIDA08.487 - 100%102.0111.004 - Baixo2 - Médio Alto100VETERANO
9201002224454008 - BAIXO PROPENSOINADIMPLENTEINADIMPLENTE SEM NEGOCIACAOMEDIO RISCO2311MENSALISTAMENSALISTANÃO00ON TIME COMERAtivoCursado0.000.04523.004523.00> 150005.803 - 21% A 40%83.052.004 - Baixo4 - Baixo100VETERANO

Last rows

COD_MATRICULAALUNO_MATRICULADO_PROX_PERIODOALUNO_MATRICULADOCONDICAO_GERENCIALCONDICAO_IMPEDE_RENOVACAOTIPO_INADIMPLENCIARISCO_INADIMPLENCIACOD_CURSOCOD_TURNOCOD_TIPO_CURSOCLASSE_PAGANTE_ANTCLASSE_PAGANTE_ATUNOVO_FIESPRV_ANTPRV_ATULATE_COMERNOM_SITUACAO_ALUNONOM_SITUACAO_PERIODO_ANTVAL_A_PAGARVAL_A_PAGAR_PARVAL_DIVIDA_MENSVAL_DIVIDA_TOTALFAIXA_DE_DIVIDAADIMP_N_RENCR_PER_ANTFAIXA_APROVACAOQTD_ACESSOS_19_2QTD_ACESSOS_20_1IND_INDICIO_EVASAOCLASSIFICACAO_PROP_EVASAOCLASSIFICACAO_PROP_RENOVACAOACEITE_CONTRATOPASTA_VERMELHAREQ_AGEND_TRANCSAFRA
2133201909215139107- BAIXO APROVEITAMENTOINADIMPLENTEINADIMPLENTE SEM NEGOCIACAOALTO RISCO1311MENSALISTAMENSALISTANÃO00ON TIME COMERAtivoCursado0.000.003562.193562.19> 150000.001 - 0%1.01.003 - Médio Baixo4 - Baixo010CALOURO
2134201909245641111 - ALUNO RENOVADOALUNO RENOVADOADIMPLENTEALTO RISCO1311MENSALISTAMENSALISTANÃO00ON TIME COMERAtivoCursado55.130.000.000.00SEM DIVIDA00.001 - 0%53.067.003 - Médio Baixo2 - Médio Alto100CALOURO
2135201912030799007- BAIXO APROVEITAMENTOINADIMPLENTEINADIMPLENTE SEM NEGOCIACAOBAIXO RISCO166311MENSALISTAMENSALISTANÃO00ON TIME COMERAtivoCursado0.000.00419.09419.09300-50004.841 - 0%164.071.004 - Baixo3 - Médio Baixo100VETERANO
2136201912030811111 - ALUNO RENOVADOALUNO RENOVADOADIMPLENTEALTO RISCO407034MENSALISTAMENSALISTANÃO00ON TIME COMERAtivoCursado567.130.000.000.00SEM DIVIDA07.897 - 100%154.092.002 - Médio Alto3 - Médio Baixo100VETERANO
2137201912030829111 - ALUNO RENOVADOALUNO RENOVADOADIMPLENTEBAIXO RISCO166311PARPARNÃO00ON TIME COMERAtivoCursado1200.00480.000.000.00SEM DIVIDA07.107 - 100%125.097.002 - Médio Alto3 - Médio Baixo100VETERANO
2138201912030837111 - ALUNO RENOVADOALUNO RENOVADOADIMPLENTEBAIXO RISCO4004311MENSALISTAMENSALISTANÃO00ON TIME COMERAtivoCursado534.380.000.000.00SEM DIVIDA07.547 - 100%143.0133.004 - Baixo3 - Médio Baixo100VETERANO
2139201912030853111 - ALUNO RENOVADOALUNO RENOVADOADIMPLENTEBAIXO RISCO4004311PARPARNÃO00ON TIME COMERAtivoCursado1157.485.160.005.16SEM DIVIDA06.315 - 61% A 80%64.042.002 - Médio Alto3 - Médio Baixo100VETERANO
2140201912030861002 - ALUNO EVADIDOALUNO EVADIDO SEM RENOVARADIMPLENTEMEDIO RISCO2311PARPARNÃO00ON TIME COMERTrancadoCursado0.004.880.000.00SEM DIVIDA07.437 - 100%154.0109.013 - Médio Baixo3 - Médio Baixo100VETERANO
2141201912030934111 - ALUNO RENOVADOALUNO RENOVADOINADIMPLENTE COM NEGOCIACAOBAIXO RISCO29311PARPARNÃO00ON TIME COMERAtivoCursado1486.82606.040.00489.79SEM DIVIDA06.147 - 100%154.0147.004 - Baixo3 - Médio Baixo100VETERANO
2142201912030951111 - ALUNO RENOVADOALUNO RENOVADOINADIMPLENTE COM NEGOCIACAOMEDIO RISCO29311PARPARNÃO00ON TIME COMERAtivoCursado1399.365.130.00844.74SEM DIVIDA04.262 - 1% A 20%57.065.002 - Médio Alto3 - Médio Baixo100VETERANO